Inventi Impact: Audio, Speech & Music Processing

Articles

Inventi:easm/14947/14

The Influence of Speech Rate on Fujisaki Model Parameters

01-Jan-1970 Research 2015 : January - March

Hansjorg Mixdorff, Adrian Leemann, Volker Dellwo

The current paper examines influences of speech rate on Fujisaki model parameters based on read speech from the\nBonn Tempo-Corpus containing productions by 12 native speakers of German at five different intended tempo levels\n(very slow, slow, normal, fast, fastest possible). The normal condition was produced at an average rate of 6.34 syllables/s\nor 100%, the very slow version at 67%, and the fastest version at 161% of the normal rate. We extracted F0 contours\nand subjected them to decomposition using the Fujisaki model. We ordered all the data with respect to their actual\nspeech rates. First, we assessed how prosodic realizations vary with speech rate and examined phrase command\nmagnitudes, the number of phrase commands as well as the base frequency, accent command amplitudes, and the\ntiming of accent command with respects to the underlying syllables and their nuclear vowels. Second, we analyzed\nbetween-sentence variability within and between speakers and investigated whether and how the prosodic structure is\npreserved at different speech rates. For very slow speech, we found for some of the speakers that the original phrase\nstructure had disintegrated into something like a list of isolated words separated by pauses. Very fast speech became\nchains of uniform syllables at very high pitch and with almost flat intonation. With respect to the F0 range reflected by\nthe amplitude of accent commands, we found strong interspeaker differences. While four of the subjects exhibited a\nsignificant reduction at higher speech rates, the others did not. As speed increases, it appears that F0 gestures\ncommence earlier in the syllable, that is, the onset time of accent commands is located closer to the syllable/vowel\nonset than at lower speed.

How to Cite this Article
CC Compliant Citation: Mixdorff et al.: The influence of speech rate on Fujisaki model parameters. EURASIP Journal on Audio,\nSpeech, and Music Processing 2014 2014:33, doi:10.1186/s13636-014-0033-6, (http://creative commons.org/licenses/by/2.0).
Download Full Text

Call Us: +4 (800) 888-0008

Inventi Impact: Audio, Speech & Music Processing

Articles

Inventi:easm/14947/14

The Influence of Speech Rate on Fujisaki Model Parameters

How to Cite this Article

Links

Contact Us